Automatically Extracting Class Diagrams from Spreadsheets

نویسندگان

  • Felienne Hermans
  • Martin Pinzger
  • Arie van Deursen
چکیده

The use of spreadsheets to capture information is widespread in industry. Spreadsheets can thus be a wealthy source of domain information. We propose to automatically extract this information and transform it into class diagrams. The resulting class diagram can be used by software engineers to understand, refine, or re-implement the spreadsheet’s functionality. To enable the transformation into class diagrams we create a library of common spreadsheet usage patterns. These patterns are localized in the spreadsheet using a twodimensional parsing algorithm. The resulting parse tree is transformed and enriched with information from the library. We evaluate our approach on the spreadsheets from the Euses Spreadsheet Corpus by comparing a subset of the generated class diagrams with reference class diagrams created manually.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improved Model-Based Technique for Generating Test Scenarios from UML Class Diagrams

The foundation of any software testing process is test scenario generation. This is because it forecasts the expected output of a system under development by extracting the artifacts expressed in any of the Unified Modeling Language (UML) diagrams, which are eventually used as the basis for software testing. Class diagrams are UML structural diagrams that describe a system by displaying its cla...

متن کامل

Extracting and Semantically Integrating Implicit Schemas from Multiple Spreadsheets of Biology based on the Recognition of their Nature

Spreadsheets are popular among users and organizations, becoming an essential data management tool. The easiness to handle spreadsheets associated with the creative freedom resulted in an increase in the volume of data available in this format. However, spreadsheets are not conceived to integrate data from distinct sources and challenges arise involving systematization of processes to reuse and...

متن کامل

Natural Language Processing based Automatic Multilingual Code Generation

Unified modeling language is being used as a premier tool for modeling the user requirements. These CASE tools provide an easy way to get efficient solutions. This paper presents a natural language processing based automated system for generating code in multilanguages after modeling the user requirements based on UML. UML diagrams are first generated by analyzing the given business scenario pr...

متن کامل

Business Rule Mining from Spreadsheets

Business rules represent the knowledge that guides the operations of a business organization. They are implemented in software applications used by organizations, and the activity of extracting them from software is known as business rule mining. It has various purposes amongst which migration and generating documentation are the most common. However, apart from conventional software, organizat...

متن کامل

Breviz: Spreadsheet Visualization and Quality Analysis

Thanks to their flexibility and intuitive programming model, spreadsheets are widely used in industry, often for business-critical applications. Similar to software developers, professional spreadsheet users demand support for understanding spreadsheets, and analyzing their quality. This paper describes Breviz, a tool that visualizes spreadsheets as leveled dataflow diagrams, showing users all ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010